Almost Optimal Fully LZW-Compressed Pattern Matching
نویسندگان
چکیده
Given two strings: pattern P and text T of lengths jPj =M and jT j = N . A string matching problem is to nd all occurrences of pattern P in text T . A fully compressed string matching problem is the string matching problem with input strings P and T given in compressed forms p and t respectively, where jpj = m and jtj = n. We present rst, almost optimal, string matching algorithms for LZW-compressed strings running in: 1. O((n+m) log(n+m))-time on a single processor machine, and 2. e O(n+m) work on a (n+m)-processor PRAM1. Techniques used in our paper can be used in design of e cient algorithms for a wide range of the most typical string problems, in the compressed LZW setting, including: computing a period of a word, nding repetitions, symmetries, counting subwords, and multi-pattern matching.
منابع مشابه
Tying up the loose ends in fully LZW-compressed pattern matching
We consider a natural generalization of the classical pattern matching problem: given compressed representations of a pattern p[1. . M ] and a text t[1. . N ] of sizes m and n, respectively, does p occur in t? We develop an optimal linear time solution for the case when p and t are compressed using the LZW method. This improves the previously known O((n + m) log(n + m)) time solution of G asien...
متن کاملA Unifying Framework for Compressed Pattern Matching
We introduce a general framework which is suitable to capture an essence of compressed pattern matching according to various dictionary based compressions. The goal is to find all occurrences of a pattern in a text without decompression, which is one of the most active topics in string matching. Our framework includes such compression methods as Lempel-Ziv family, (LZ77, LZSS, LZ78, LZW), byte-...
متن کاملShift-And Approach to Pattern Matching in LZW Compressed Text
This paper considers the Shift-And approach to the problem of pattern matching in LZW compressed text, and gives a new algorithm that solves it. The algorithm is indeed fast when a pattern length is at most 32, or the word length. After an O(m + |Σ|) time and O(|Σ|) space preprocessing of a pattern, it scans an LZW compressed text in O(n + r) time and reports all occurrences of the pattern, whe...
متن کاملA Fully Compressed Pattern Matching Algorithm for Simple Collage Systems
We study the fully compressed pattern matching problem (FCPM problem): Given T and P which are descriptions of text T and pattern P respectively, find the occurrences of P in T without decompressing T or P. This problem is rather challenging since patterns are also given in a compressed form. In this paper we present an FCPM algorithm for simple collage systems. Collage systems are a general fr...
متن کاملBeating O(nm) in approximate LZW-compressed pattern matching
Given an LZW/LZ78 compressed text, we want to find an approximate occurrence of a given pattern of length m. The goal is to achieve time complexity depending on the size n of the compressed representation of the text instead of its length. We consider two specific definitions of approximate matching, namely the Hamming distance and the edit distance, and show how to achieve O(n √ mk) and O(n √ ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999